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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Schmidt, Eduard Daniel Leendert 
De Vries, Sape Cornelis 
Hecht, Valerie France Gabrielle 

(ii) TITLE OF INVENTION: Apomixis Conferred By Expression of SERK 

Interacting Proteins 

(iii) NUMBER OF SEQUENCES: 18 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE : Syngenta Patent Dept . 

(B) STREET: 3 054 Cornwallis Road 

(C) CITY: RTP 

(D) STATE: NC 

(E) COUNTRY: USA 

(F) ZIP: 27709 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS -DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT/EP99/ 07 972 

(B) FILING DATE: 20-OCT-1999 

( C ) C LAS S I F I CAT I ON : 

(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Meigs , J. Timothy 

(B) REGISTRATION NUMBER: 3 8,241 

(C) REFERENCE/DOCKET NUMBER: S-30683A 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: 919-541-8587 



(2) INFORMATION FOR SBQ ID NO: 1: 

(i) SEQUENCE CHARACIERISTICS : 

(A) LENGTH: 551 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
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(iii) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis tJialiana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 3A35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

AOGTGTCOCT QGAGGCQGGT OGGGTCAGTC GGCTCAGATA CCAAGGTGCC AAGIGGAAGG 60 

TTGTOGGATG GATCIAACCA ATGCAAAAGG TTATTACTOG AGACACCGAG TITGTGGAGr 120 

GCACrCTAAA ACACCTAAAG TCACIGTGGC TOGIATCGAA CAGAGGlTiT GTCAACAGIG 180 

CAGCAGGITT CATCAGCTTC OGGAATTTGA CCTAGAGAAA AGGAGITGCC GCAGGAGACT 240 

OGCrOGTCAT AATGAGGGAC GAAGGAAGCC ACAGCCTGOG TCTCrCrCTG TGrTAGCTTC 300 

TOGITAOGGG AGGATOGCAC CTTCGCITrA OGAAAATGGI GATGCIGGAA TGAATGGAAG 360 

CnrCnGGG AACCAAGAGA TAGGATQGCC AAGITCAAGA ACATTOGATA CAAGAGTGAT 420 

GAGGOQGCCA GTGTCATCAC OCTCATGGCA GATCAATCCA ATGAATCTAT TIAGTCAAGG 480 

TrCAGirOGI QGAGGAAGGA CAAGCITCrC ATCTCCAGAG ATTATQGACA CTAAACTAGA 540 

GAGCTACAAG G 551 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 375 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iii) ANTI -SENSE: NO 

(vi) ORIGINAL- SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 
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(vii) IMMEDIATE SOURCE: 
(B) CLONE: 3A35 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 2: 

Met Glu Met Gly Ser Asn Ser Gly Pro Gly His Gly Pro Gly Gin Ala 
15 10 15 

Glu Ser Gly Gly Ser Ser Thr Glu Ser Ser Ser Phe Ser Gly Gly Leu 
20 25 30 

Met Phe Gly Gin Lys He Tyr Phe Glu Asp Gly Gly Gly Gly Ser Gly 
35 40 45 

Ser Ser Ser Ser Gly Gly Arg Ser Asn Arg Arg Val Arg Gly Gly Gly 
50 55 60 

Ser Gly Gin Ser Gly Gin He Pro Arg Cys Gin Val Glu Gly Cys Gly 
65 70 75 80 

Met Asp Leu Ihr Asn Ala Lys Gly Tyr Tyr Ser Arg His Arg Val Cys 
85 90 95 

Gly Val His Ser Lys Thr Pro Lys Val Thtr Val Ala Gly He Glu Gin 
100 105 110 

Arg Phe Cys Gin Gin Cys Ser Arg Phe His Gin Leu Pro Glu Phe Asp 
115 120 125 

Leu Glu Lys Arg Ser Cys Arg Arg Arg Leu Ala Gly His Asn Glu Arg 
130 135 140 

Arg Arg Lys Pro Gin Pro Ala Ser Leu Ser Val Leu Ala Ser Arg Tyr 
145 150 155 160 

Gly Arg He Ala Pro Ser Leu Tyr Glu Asn Gly Asp Ala Gly Met Asn 
165 170 175 

Gly Ser Phe Leu Gly Asn Gin Glu He Gly Trp Pro Ser Ser Arg Thr 
180 185 190 

Leu Asp Thr Arg Val Met Arg Arg Pro Val Ser Ser Pro Ser Trp Gin 
195 200 205 

He Asn Pro Met Asn Val Phe Ser Gin Gly Ser Val Gly Gly Gly Arg 
210 215 220 



Thtr Ser Phe Ser Ser Pro Glu He Met Asp Thr Lys Leu Glu Ser Tyr 
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225 230 235 240 

Lys Gly lie Gly Asp Ser Asn Cys Ala Leu Ser Leu Leu Ser Asn Pro 
245 250 255 

His Gin Pro His Asp Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn 
260 265 270 

Asn Asn Asn Thr Trp Arg Ala Ser Ser Gly Phe Gly Pro Met Thr Val 
275 280 285 

Thr Met Ala Gin Pro Pro Pro Ala Pro Ser Gin His Gin Tyr Leu Asn 
290 295 300 

Pro Pro Trp Val Phe Lys Asp Asn Asp Asn Asp Met Ser Pro Val Leu 
305 310 315 320 

Asn Leu Gly Arg Tyr Thr Glu Pro Asp Asn Cys Gin lie Ser Ser Gly 
325 330 335 

Thr Ala Met Gly Glu Phe Glu Leu Ser Asp His His His Gin Ser Arg 
340 345 350 

Arg Gin Tyr Met Glu Asp Glu Asn Thr Arg Ala Tyr Asp Ser Ser Ser 
355 360 365 

His His Thr Asn Trp Ser Leu 
370 375 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SECUENCE CHARACTERISTICS: 

(A) LENGTH: 859 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 3B39 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

TCAACATTGC TTCCTAACCA GAAATCGACC ATCMCTTCC CAOGAATACA ACITAAAGCr 60 

TTACCAGAAA ATQGAQQGrC AGAGAACACA ACGCCGGGCT TACITGAAAG ACAAGGCTAC 120 

ACTCrCCAAC CITGITGAAG AAGAAATQGA GAATGGCATG GATGGAGAAG AGGAGGATX3G 180 

AGGAGACGAA GACAAAAGGA AGAAGCTGAT GGAAAGAGTT AGAGCJTCCTA GCACTGACOG 240 

TCTTCCATOG a^CTCTGCC AGGTOGATAG GTGCACrGTT AATTIGACrG AGGCCAAGCA 300 

GTATTACCGC AGACACAGAG TATGTGAACT ACATGCA/^ GCATCTGCTG GGACIGITGC 360 

AGGQCTCAGG CAACGCTiTl" GTCAACAATG CAGCAGGnT CATGAGCTAC CAGAGnTGA 420 

TGAAGCTAAA AGAAGCTGCA GGAGGGGCIT AGCTOGACAC AATGAGAGGA QGAGGAAGAT 480 

CrCrOCTGAC ACTTITGGAG AAGGCTCAGG CGGGAGAGGG TrrAGGQGrC AACTGATCCA 540 

GACrCAAGAA AGAAACAGGG TAGACAGGAA ACirCCTATG ACCAACTCAT CATTTAAGGG 600 

ACCACAGATC AGATAAACCC TCCOGCTCTC TCTCITCrGr CATCTACATA TGCTCTATCT 660 

ACACrCITAT TAGACAAATA ATOGCATCTA ACAATCTCAA GAAAAGITGG TCATGGTATT 720 

AAATCCTAGA GGGAAATATA ACTATAAACC TTTAGrCCCC TITATCCrGT CCICTAATGA 780 

ATATCTATCC GGAAATGTAT TOGCATACTC TK^CGICIAA TAATGnTAT TAAAAAAAAA 840 

AAAAAAAAAA AAAAAAAAA 859 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) liENGTH: 181 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: ^ 

(A) ORGANISM: Arabidopsis thaliana 
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(vii) IMMEDIATE SOURCE: 
(B) CLONE: 3B39 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Glu Gly Gin Airg Thr Gin Arg Arg Gly Tyr Leu Lys Asp Lys Ala 
15 10 15 

Thr Val Ser Asn Leu Val Glu Glu Glu Met Glu Asn Gly Met Asp Gly 
20 25 30 

Glu Glu Glu Asp Gly Gly Asp Glu Asp Lys Arg Lys Lys Val Met Glu 
35 40 45 

Arg Val Arg Gly Pro Ser Thr Asp Arg Val Pro Ser Arg Leu Cys Gin 
50 55 60 

Val Asp Arg Cys Thr Val Asn Leu Thr Glu Ala Lys Gin Tyr Tyr Arg 
65 70 75 80 

Arg His Arg Val Cys Glu Val His Ala Lys Ala Ser Ala Ala Thr Val 
85 90 95 

Ala Gly Val Arg Gin Arg Phe Cys Gin Gin Cys Ser Arg Phe His Glu 
100 105 110 

Leu Pro Glu Phe Asp Glu Ala Lys Arg Ser Cys Arg Arg Arg Leu Ala 
115 120 125 

Gly His Asn Glu Arg Arg Arg Lys lie Ser Gly Asp Ser Phe Gly Glu 
130 135 140 

Gly Ser Gly Arg Arg Gly Phe Ser Gly Gin Leu lie Gin Thr Gin Glu 
145 150 155 160 

Arg Asn Arg Val Asp Arg Lys Leu Pro Met Thr Asn Ser Ser Phe Lys 
165 170 175 

Gly Pro Gin lie Arg 
180 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACIERISTICS : 

(A) LENGTH: 479 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(D) TOPOLOGY: linear 
(ii) IVDIiECULiE TYPE: cDNA to niRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidcpsis thaliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 4B19 

(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 5: 

AGAAGCAGAA AGGIAAAGCT ACAACTAGTA GrGGAGITIG TCAGGTCGAG AGTICTACOG 60 

OGGATATGAG CAAAGCCAAA CAGTACCACA AACGACACAA AGTCTGCCAG TTTCATGCCA 120 

AAGCTCCrCA TGirOOGATC TCrGGTCTTC ACCAAOGITT CIGCCAACAA TGCAGCAGGI 180 

TTCACGOGCr CAGTGAGnT GATGAAGCCA AGOQGAGriG CAGGAGAGGC TTAGCIGGAC 240 

ACAAGGAGAG AAGGGOGAAA AGCACAACIG ACTAAAGAOG GIGAAACGIG TGAGATCCGG 300 

GITTGAAGGr TAATGAAACA GGCmGCIT ACTCrCITCr GTCAGrCTCT TTTAGCTCCr 360 

TGIAATCCTC TGIGTCrCTG TCTGriTCrC CATATIACCT GIAATCAAAG CTATCIGCTA 420 

AACCTAOGAC ATOGTEAAAT AAATGCATIG AGACITAAAA AAAAAAAAAA AAAAAAAAA 479 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iii) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidcpsis thaliana 
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(vii) IMMEDIATE SOURCE: 
(B) CLONE: 4B19 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ser Met Arg Arg Ser Lys Ala Glu Gly Lys Airg Ser Leu Arg Glu 
15 10 15 

Leu Ser Glu Glu Glu Glu Glu Glu Glu Glu Thr Glu Asp Glu Asp Thr 
20 25 30 

Phe Glu Glu Glu Glu Ala Leu Glu Lys Lys Gin Lys Gly Lys Ala Thr 
35 40 45 

Ser Ser Ser Gly Val Cys Gin Val Glu Ser Cys Thr Ala Asp Met Ser 
50 55 60 

Lys Ala Lys Gin Tyr His Lys Arg His Lys Val Cys Gin Phe His Ala 
65 70 75 80 

Lys Ala Pro His Val Arg lie Ser Gly Leu His Gin Arg Phe Cys Gin 
85 90 95 

Gin Cys Ser Arg Phe His Ala Leu Ser Glu Phe Asp Glu Ala Lys Arg 
100 105 110 

Ser Cys Arg Arg Arg Leu Ala Gly His Asn Glu Arg Arg Arg Lys Ser 
115 120 125 

Thr Thr Asp 
130 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2682 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Arabidopsis t±ialiana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 3A52 

(xi) SEQUENCE DESGRIPTION: SBQ ID NO: 7: 

GCCATTCAAG GAGACACTAA TGCTGCICIT ACITTGAATC TTAATGGTGA AAGTGATGGC 60 

cirrrrcciG ccaagaagac caaatcggga gcgctttgtc agctcgaaaa ctgtgaagct 120 

GATCTTACTA AAGITAAGGA TIATCATAGA CGCCATAAGG TCTCTGAGAT GCATICCAAG 180 

GCTACrAGTG CCACIGTCQG AGCTATCITG CAGCGClTiT GTCAGCAATG TACTAGCTTC 240 

CATCirCTGC CMGirrGGA TGAa3GAAAG AGAAGITCTC CTAGACXJnT GGCTOGCCAT 300 

AATAAAGCTC OGAGGAAAAC AAATCCOGAA CCIGGOGCrA AOQGGAATCC TAGTGATGAT 360 

CACTCAAGGA ACIATCrCIT GATTACICrC TTGAAGATAC TCPCCAATAT GCATAACCAT 420 

ACn3GTGATC AAGATITGAT GrCTCATCTT CTGAAGAGCC TCGTAAGCCA TGCTGGCGAA 480 

CAGITAGGGA AAAACITAGT TGAACirCIT CIACAAQGAG AGATCPCAAG GTTCCITAAA 540 

ATATIGGAAA ACTGGGCnT GCITQGGATr GAGCAAGCIC CTCAAGAGGA GITAAAGCAA 600 

TirrCGGCrC QGCAAGATGG GACAGCIACC GAGAACAGAT CAGAAAAACA AGTCAAAATG 660 

AATGATTnG ATTTGAATGA TATCIATATA GACTCAGATG ACACAGAOCT CGAAAGATCI 720 

CCTCCTCCAA GGAATCCAGC GACCAGITCr CnGATEATC CITCATGGAT ACATCAGTCT 780 

AGTCOSCXnx: AGACAAGTAG GAATTCAGAT TCAGCATCTG ACCAGICACC CrCAAGITCr 840 

AGTGAAGATG CICAGATGCG CACAGGCOGG ATTGTGITCA AACTATTrOG GAAAGAGCCA 900 

AATGAAirrC CTATIGrCIT AOGAGGACAG ATIdTGACr GGITATOGCA TAGICCAACT 960 

GACATOGAGA GCTACATAAG ACCrOGCTGr ATOGTATTGA CCATCIATCT TCGTCAAGCT 1020 

GAAACIGCn GGGAAGAACI TICAGACGAT CTOGGl'l'l'lA GCITAGQGAA GCnCIAGAT 1080 

CrcrCCGATG ATCCCriGrG GACAACTGGA TGGATTTATG TAGGGTGGAG AACXIAACITG 1140 

CATTICTATA TAAOQGTCAG GTrGrCGITG ACAdTCATT GTCTCIAAAA AGTOGrGATT 1200 

ATAGTCACAT CAITAGOGIT AAACOGCITG CIATAGCTGC AAOQGAGAAG GCrCAATTTA 1260 
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CAGTTAAAQG CATGAATCTC OCTCX3GGCTG GCACAAGGIT ACTTIGITCr GTTCAAGGAA 1320 

AATACTTGAT TCAGGAAACA ACACAGGATT CGACX:=IACCAG GGAGGATGAC GATTTCAAGG 1380 

ACAACAGTGA GATTGTTGAG TGTGTAAACr TCTCTICTGA TATGCCTATA TTGACTGGTC 1440 

GAGGATTCAT GGAGATTGAA GACXZAAGGAC TCACTAGCAG C^ETCTTCCXT TIUrTACTOG 1500 

TTGAAGATGA CGATGnTGT TCTGAAATCC CTATACITGA AACCACATTA GAGITCACIG 1560 

GAACTGATTC TGCTAAGCAA GCTATOGAIT TCATACATGA AATCGGITGG CTTCITCACA 1620 

GAACTAAACr TQQQGAATCA GACXXAAATC CAGGCUlTiT CCCATTAATA COZTTCCACT 1680 

QGCTAATOGA GITCTCAATG GATC3GAGAGT GGTGaGCTCT GATCAGAAAG CTATTAAACA 1740 

TCTIXTTTTGA TOGAGCTGIT GGTGAATnT CITCXJIXXTC TAATGCCACA CTGTCAGAAC 1800 

TGaXXXriTCr TCACAGAGCX: CTGAGGAAAA ACTCTAAGCC TATGGTTGAA ATGCTCITGA 1860 

GATATATTCC CAAGCAACAG AGAAACAGCT TGTITAGACC OGATGCTGCT GGTCCAGCOG 1920 

GCITAACACC TCTTCATATT GCAGCTOGTA AAGACaGTTC AGAAGATCTG TTGGATGGGC 1980 

TAACAGAAGA TCCTGCAATG GrGGGGATIG AAGGCTGGAA GACATGTOGA GACAGCACAG 2040 

GCITCACACC AGAAGACTAC GCACGCTTAC GCGGTCACIT CTCATACATC CACITGAITC 2100 

AAOGCAAGAT CAATAAAAAG TCAACAACIG AAGATCATGI TCTOGTCAAC ATCXX^^CTTT 2160 

CirrcrCAGA CAGAGAGCAG AAAGAACCAA AATCAGGTCC GATGGCITCA GCCrrOGAGA 2220 

TCACACAGAT TCCATGCAAG CTCTCTGACC ATAAACrOGT GTATQQGACA AGAOGGAGCT 2280 

CICTAGOJrA CAGACCAGCr ATCTTGTCAA TOGrGGOGAT TGCrGCJOGIT TGOGTCICTG 2340 

TOGCACnUr GTITAAGACT TGCCXDGGAAG TGCrCEAICT GnrCAAODG TTCAGGrOGG 2400 

ACTTATTGGA CTATGGAACA AGCTGACTGT AAGTCTACIT TGAAAGATCT TCTAAGATAT 2460 

ATATATGAAT GITACTTATA TAAAACCATA GAGGTGTGAT TTCTATATGr AACTATATGA 2520 

GTATAAGATA TAGAGACATG TTOGAGAAGA AGATTGITGr TATTATTGIT GITCTrGTIG 2580 

TTGTGTAAAA GCCICTCXTTA TCICTCIXDGA ACCTAAGGAT TCIXTTCTCTG ATTAGTATAT 2640 

TITirGTITG ACAAAAAAAA AAAAAAAAAA AAAAAAAAAA AA - 2682 
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(2) INFORMATION FOR SBQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENCTH: 848 amino acids 

(B) TYPE: andno acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGIN?^ SOURCE: 

(A) ORGANISM: Arabidcpsis t±ialiana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 3A52 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 8: 

Met Glu Ala Arg lie Asp Glu Gly Gly Glu Ala Gin Gin Phe Tyr Gly 
15 10 15 

Ser Val Gly Asn Ser Ser Asn Ser Ser Ser Ser Cys Ser Asp Glu Gly 
20 25 30 

Asn Asp Lys Lys Arg Arg Ala Val Ala lie Gin Gly Asp Thr Asn Gly 
35 40 45 

Ala Leu Thr Leu Asn Leu Asn Gly Glu Ser Asp Gly Leu Phe Pro Ala 
50 55 60 

Lys Lys Thr Lys Ser Gly Ala Val Cys Gin Val Glu Asn Cys Glu Ala 
65 70 75 80 

Asp Leu Ser Lys Val Lys Asp Tyr His Arg Arg His Lys Val Cys Glu 
85 90 95 

Met His Ser Lys Ala Thr Ser Ala Thr Val Gly Gly He Leu Gin Arg 
100 105 110 

Phe Cys Gin Gin Cys Ser Arg Phe His Leu Leu Pro Gly Phe Asp Asp 
115 120 125 

Gly Lys Arg Ser Cys Arg Arg A2:g Leu Ala Gly His Asn Lys Arg Pro 
130 135 140 
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Air^g Lys Thr Asn Pro Glu Pro Gly Ala Asn Gly Asn Pro Ser Asp Asp 
145 150 155 160 

His Ser Ser Asn Tyr Leu Leu lie Thr Leu Leu Lys lie Leu Ser Asn 
165 170 175 

Met His Asn His Thr Gly Asp Gin Asp Leu Met Ser His Leu Leu Lys 
180 185 190 

Ser Leu Val Ser His Ala Gly Glu Gin Leu Gly Lys Asn Leu Val Glu 
195 200 205 

Leu Leu Leu Gin Gly Arg Arg Ser Gin Gly Ser Leu Asn lie Gly Asn 
210 215 220 

Ser Ala Leu Leu Gly lie Glu Gin Ala Pro Gin Glu Glu Leu Lys Gin 
225 230 235 240 

Phe Ser Ala Arg Gin Asp Gly Thr Ala Thr Glu Asn Arg Ser Glu Lys 
245 250 255 

Gin Val Lys Met Asn Asp Phe Asp Leu Asn Asp lie Tyr lie Asp Ser 
260 265 270 

Asp Asp Thr Asp Val Glu Arg Ser Pro Pro Pro Thr Asn Pro Ala Thr 
275 280 285 

Ser Ser Leu Asp Tyr Pro Ser Trp lie His Gin Ser Ser Pro Pro Gin 
290 295 300 

Thr Ser Arg Asn Ser Asp Ser Ala Ser Asp Gin Ser Pro Ser Ser Ser 
305 310 315 320 

Ser Glu Asp Ala Gin Met Arg Thr Gly Arg lie Val Phe Lys Leu Phe 
325 330 335 

Gly Lys Glu Pro Asn Glu Phe Pro lie Val Leu Arg Gly Gin lie Leu 
340 345 350 

Asp Trp Leu Ser His Ser Pro Thr Asp Met Glu Ser Tyr lie Arg Pro 
355 360 365 

Gly Cys lie Val Leu Thr He Tyr Leu Arg Gin Ala Glu Thr Ala Trp 
370 375 380 



Glu Glu Leu Ser Asp Asp Leu Gly Phe Ser Leu Gly Lys Leu Leu Asp 
385 390 395 400 
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Leu Ser Asp Asp Pro Leu Trp Thr Thr Gly Trp lie Tyr Val Arg Val 
405 410 415 

Gin Asn Gin Leu Ala Phe Val Tyr Asn Gly Gin Val Val Val Asp Thr 
420 425 430 

Ser Leu Ser Leu Lys Ser Arg Asp Tyr Ser His lie lie Ser Val Lys 
435 440 445 

Pro Leu Ala lie Ala Ala Thr Glu Lys Ala Gin Phe Thr Val Lys Gly 
450 455 460 

Met Asn Leu Arg Arg Arg Gly Thr Arg Leu Leu Cys Ser Val Glu Gly 
465 470 475 480 

Lys Tyr Leu lie Gin Glu Thr Thr His Asp Ser Thr Thr Arg Glu Asp 
485 490 495 

Asp Asp Phe Lys Asp Asn Ser Glu lie Val Glu Cys Val Asn Phe Ser 
500 505 510 

Cys Asp Met Pro lie Leu Ser Gly Arg Gly Phe Met Glu lie Glu Asp 
515 520 525 

Gin Gly Leu Ser Ser Ser Phe Phe Pro Phe Leu Val Val Glu Asp Asp 
530 535 540 

Asp Val Cys Ser Glu lie Arg lie Leu Glu Ttir Thr Leu Glu Phe Thr 
545 550 555 560 

Gly Thr Asp Ser Ala Lys Gin Ala Met Asp Phe lie His Glu lie Gly 
565 570 575 

Trp Leu Leu His Arg Ser Lys Leu Gly Glu Ser Asp Pro Asn Pro Gly 
580 585 590 

Val Phe Pro Leu lie Arg Phe Gin Trp Leu lie Glu Phe Ser Met Asp 
595 600 605 

Arg Glu Trp Cys Ala Val lie Arg Lys Leu Leu Asn Met Phe Phe Asp 
610 615 620 

Gly Ala Val Gly Glu Phe Ser Ser Ser Ser Asn Ala Thr Leu Ser Glu 
625 630 635 640 

Leu Cys Leu Leu His Arg Ala Val Pocg Lys Asn Ser Lys Pro Met Val 
645 650 655 



Glu Met Leu Leu Arg Tyr lie Pro Lys Gin Gin Arg Asn Ser Leu Phe 
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660 665 670 

Arg Pro Asp Ala Ala Gly Pro Ala Gly Leu Thr Pro Leu His lie Ala 
675 680 685 

Ala Gly Lys Asp Gly Ser Glu Asp Val Leu Asp Ala Leu Thr Glu Asp 
690 695 700 

Pro Ala Met Val Gly lie Glu Ala Trp Lys Thr Cys Arg Asp Ser Thr 
705 710 715 720 

Gly Phe Thr Pro Glu Asp Tyr Ala Arg Leu Arg Gly His Phe Ser Tyr 
725 730 735 

lie His Leu lie Gin Arg Lys lie Asn Lys Lys Ser Thr Thr Glu Asp 
740 745 750 

His Val Val Val Asn lie Pro Val Ser Phe Ser Asp Arg Glu Gin Lys 
755 760 765 

Glu Pro Lys Ser Gly Pro Met Ma Ser Ala Leu Glu lie Thr Gin lie 
770 775 780 

Pro Cys Lys Leu Cys Asp His Lys Leu Val Tyr Gly Thr Thr Arg Arg 
785 790 795 800 

Ser Val Ala Tyr Arg Pro Ala Met Leu Ser Met Val Ala lie Ala Ala 
805 810 815 

Val Cys Val Cys Val Ala Leu Leu Phe Lys Ser Cys Pro Glu Val Leu 
820 825 830 

Tyr Val Phe Gin Pro Phe Arg Trp Glu Leu Leu Asp Tyr Gly Thr Ser 
835 840 845 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 576 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOIHEmCAL: NO 
(iii) ANTI -SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 4B11 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

CAGCGGAAGA GCTCACOGIT GAAGAGAGGA ATCTCCTCrC TGITGCrrAC AAAAAOGTGA 60 

TOOGATCTCr ACGOGCGGCC TGGAGGATGG TCTCTTCGAT TGAGCAGAAG GAAGAGACTA 120 

GGAAGAAOGA OGAGCACCTG TGGCITGTCA AOGATTACAG ATCTAAAGTT GAGTCTGAGC 180 

TrrCrrCICT TTGCrCTGGA ATCCITAAGC TCCTTGACTC GCATCTGATC CCATCTGCIG 240 

GAGOGACTGA GTCTAAGGTC TITTACITGA AGATGAAAGG TGATTATCAT OQGTACATGG 300 

CTGAGTITAA GTCrOGIGAT GAGAGGAAAA CTGCTGCrGA AGATACCATG CrOGCITACA 360 

AAGCAGCrCA GGATATOGGA GCTGCOGATA TOGCACCTAC TCATCOGATA AGGCITGGTC 420 

TOGCCCTGAA TTTCTCAGIG TrcrACTATG AGATTCTCAA TTCTTCAGAC AAAGdTGTA 480 

ACATQGCCAA ACAGGCl'i'lT GAGGAAGCCA TAGCTGAGCT TGACACTCTG GGAGAAGAAT 540 

CCTACAAAGA CAGCACTCIC ATAATGCAGT TGCTGA 576 
(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 248 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 4B11 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ala Ala Thr Leu Gly Arg Asp Gin Tyr Val Tyr Met Ala Lys Leu 
15 10 15 

Ala Glu Gin Ala Glu Arg Tyr Glu Glu Met Val Gin Phe Met Glu Gin 
20 25 30 

Leu Val Thr Gly Ala "Thr Pro Ala Glu Glu Leu Thr Val Glu Glu Arg 
35 40 45 

Asn Leu Leu Ser Val Ala Tyr Lys Asn Val lie Gly Ser Leu Arg Ala 
50 55 60 

Ala Trp Arg lie Val Ser Ser lie Glu Gin Lys Glu Glu Ser Arg Lys 
65 70 75 80 

Asn Asp Glu His Val Ser Leu Val Lys Asp Tyr Arg Ser Lys Val Glu 
85 90 95 

Ser Glu Leu Ser Ser Val cys Ser Gly lie Leu Lys Leu Leu Asp Ser 
100 105 110 

His Leu lie Pro Ser Ala Gly Ala Ser Glu Ser Lys Val Phe Tyr Leu 
115 120 125 

Lys Met Lys Gly Asp Tyr His Arg Tyr Met Ala Glu Phe Lys Ser Gly 
130 135 140 

Asp Glu Arg Lys Thr Ala Ala Glu Asp Thr Met Leu Ala Tyr Lys Ala 
145 150 155 160 

Ala Gin Asp lie Ala Ala Ala Asp Met Ala Pro Thr His Pro lie Arg 
165 170 175 

Leu Gly Leu Ala Leu Asn Phe Ser Val Phe Tyr Tyr Glu lie Leu Asn 
180 185 190 

Ser Ser Asp Lys Ala Cys Asn Met Ala Lys Gin Ala Phe Glu Glu Ala 
195 200 205 

lie Ala Glu Leu Asp Thr Leu Gly Glu Glu Ser Tyr Lys Asp Ser Thr 
210 215 220 

Leu lie Met Gin Leu Leu Arg Asp Asn Leu Ttir Leu Trp Thr Ser Asp 
225 230 235 240 
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Met Gin Glu Gin Met Asp Glu Ala 
245 

(2) INFORMATION FOR SBQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 659 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to rtlRNA 
(iii) HYPOIHETICAL: NO 
(iii) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 4A24 



(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 11: 

OGCOGCCACC GOGATGTAOG TGATCIACCA CCCTOGTCOG COGrOGITCr CGGTCCOGTC 60 

AATAAGAATC AGCGGGGTGA ACCTAACAAC CTCCTCIGAT TCCTCOGTCT CICATCTCTC 120 

TTCcncrrc aacttcactc taatctcaga gaatccaaac caacacctct cirrcrciTA iso 

CGATCCmC ACOGTCACOG TTAATTCAGC TAAATCOGGI AOGATGCTOG GTAAOGGAAC 240 

TGrrCCTGCI TTCrrCAGOG ATAACQGIAA CAAAACITOG TTTCAaGGOG TGATCGCIAC 300 

GTCTACAGOG GOGCGIGAGT TAGATCOGGA TGAAGCTAAG CATCTGAGAT CAGATCIGAC 360 

GOGOGOGOGT GTAGGATATG AGATCGAGAT GAGAACTAAA GTGAAGATGA TAATGGGGAA 420 

GCTGAAGAGT GAAGGAGTAG AGATCAAAGT GACATGITGA AQGATTTGAA QGAACTATAC 480 

CAAAAGGTAA AACTCCAATT GTAGCTACIT CTAAAAAAAC TAAGTGEAAG TCIGATCTTA 540 

GTGTCAAGTC TGGAAATQGA TTICTAAAQG AAITTGATAA TnCACATTG AAATTCTATA 600 

TATCTcrcrr TrrcrcroGA tttgigaaac tttggatgat caaagaattc ttcatigtc 659 

(2) INFORMATION FOR SEQ ID NO: 12: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENCTH: 174 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

O (vii) IMMEDIATE SOURCE: 

^ (B) CLONE: 4A24 



1.= ^: 



i2 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Arg lie Cys Cys Cys Cys Phe Trp Ser lie Leu lie lie Leu lie Leu 
15 10 15 

Ala Leu Met Thr Ala lie Ala Ala Thr Ala Met Tyr Val lie Tyr His 
20 25 30 

Pro Arg Pro Pro Ser Phe Ser Val Pro Ser lie Arg lie Ser Arg Val 
35 40 45 

Asn Leu Thr Thr Ser Ser Asp Ser Ser Val Ser His Leu Ser Ser Phe 
50 55 60 

Phe Asn Phe Thr Leu lie Ser Glu Asn Pro Asn Gin His Leu Ser Phe 
65 70 75 80 

Ser Tyr Asp Pro Phe Thr Val Thr Val Asn Ser Ala Lys Ser Gly Thr 
85 90 95 

Met Leu Gly Asn Gly Thr Val Pro Ala Phe Phe Ser Asp Asn Gly Asn 
100 105 110 

Lys Thr Ser Phe His Gly Val lie Ala Thr Ser Thr Ala Ala Arg Glu 
115 120 125 

Leu Asp Pro Asp Glu Ala Lys His Leu Arg Ser Asp Leu Thr Arg Ala 
130 135 140 
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Arg Val Gly Tyr Glu lie Glu Met Arg Thr Lys Val Lys Met He Met 
145 150 155 160 

Gly Lys Leu Lys Ser Glu Gly Val Glu He Lys Val Thr Cys 
165 170 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 584 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
Q (iii) HYPOTHETICAL: NO 

CO (iii) ANTI -SENSE: NO 



111 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 3B76 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CCTCCAACTC CAGGCCAGCC AACAAAAGAA CCTACATTTA TIXXAGTGGT TGirOGTCTT 60 

TIGGACrCAA GTOGGAAAGA CATTACTCTT TCCTCTGTTC ATTATGATGG TACAGTGCAG 120 

ACCATTTCAG GGAGCAGCAC AATACTTOGA GTGACAAGAA ACAAGAAGAG TITCTGnTT 180 

CIGATATACC AGAAAGACCT GTTCOGrCCC TATTTAGGGG ATTCAGCCCC AGTTOCTGTT 240 

GAAACTGATC TCTCTAATGA TGACTTATTC TTCCTCCTAG CACATGATTC AGATGAATTC 300 

AATAGGTGGG AGGCOGGTCA AGirCTOGCA AGAAAGCTGA TGCTGAACTT AGITTCTGAT 360 

TICCAGCAAA ATAAACOGTT GGCTCTAAAC CCAAAATTTG TGCAAGGTCT CGGCAGTGIG 420 

CirrCTGACr CAAGCrrOGA CAAGGAATTT ATAGCCAAAG CAATAACACT ACCTGQGGAG 480 

GGAGAGATAA TQGACATGAT GGCOGTOGOG GATCCIGATG CTGTTCATGC TGTTAGAAAG 540 

TTIGTAOGAA AGCAGCTTGC ATCTGAACTT AAGGAGGAGC TTCr 584 
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(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 283 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iii) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 3B76 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Pro Pro Thr Pro Gly Gin Pro Thr Lys Glu Pro Thr Phe lie Pro Val 
15 10 15 

Val Val Gly Leu Leu Asp Ser Ser Gly Lys Asp lie Thr Leu Ser Ser 
20 25 30 

Val His Tyr Asp Gly Thr Val Gin Thr lie Thr Gly Ser Ser Thr lie 
35 40 45 

Leu Arg Val Thr Lys Lys Gin Glu Glu Phe Val Phe Ser Asp lie Pro 
50 55 60 

Glu Arg Pro Val Pro Ser Leu Phe Arg Gly Phe Ser Ala Pro Val Arg 
65 70 75 80 

Val Glu Thr Asp Leu Ser Asn Asp Asp Leu Phe Phe Leu Leu Ala His 
85 90 95 

Asp Ser Asp Glu Phe Asn Arg Trp Glu Ala Gly Gin Val Leu Ala Arg 
100 105 110 

Lys Leu Met Leu Asn Leu Val Ser Asp Phe Gin Gin Asn Lys Pro Leu 
115 120 125 

Ala Leu Asn Pro Lys Phe Val Gin Gly Leu Gly Ser Val Leu Ser Asp 
130 135 140 
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Ser Ser Leu Asp Lys Glu Phe lie Ala Lys Ala lie Thr Leu Pro Gly 
145 150 155 160 

Glu Gly Glu He Met Asp Met Met Ala Val Ala Asp Pro Asp Ala Val 
165 170 175 

His Ala Val Arg Lys Phe Val Arg Lys Gin Leu Ala Ser Glu Leu Lys 
180 185 190 

Glu Glu Leu Lys He Val Glu Asn Asn Arg Ser Thr Glu Ala Tyr Val 
195 200 205 

Phe Asp His Ser Asn Met Ala Arg Arg Ala Leu Lys Asn Thr Ala Leu 
210 215 220 

Q Ala Tyr Leu Ala Ser Leu Glu Asp Pro Ala Tyr Met Gly Thr Cys Thr 

225 230 235 240 

til 

Glu Arg lie Gin Gly Gly His Gin Phe Asp Arg Pro He Cys Cys Phe 
'P 245 250 255 

^Jj Gly Thr Leu Ser Gin Asn Pro Gly Lys Thr Arg Glu Arg Thr Phe Leu 

260 265 270 

Ci 

J. Pro Asp Phe Tyr Glu Gin Val Ala Gly Thr He 

l1 275 280 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to rriRNA 
(iii) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidopsis thaliana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 4A5 
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(xi) SEQUENCE DESCRIPTION: SBQ ID NO: 15: 

ACCAGGAGQG GAAAAAGTCT TACCCCATQG ACATCCOOOG GATTGAGrGT TACCOGAAAA 60 

QGATGAAGAA TOCTATTCCT COGTOCTGGA CCCCATGCAC CCATTOGGAA AGCOGTCTOG 120 

CGTirrCriT CAGGGATGAT AGAAAACTGC TCCCTIGGGA TGGAAAGGAG GAGCCTITAC 180 

TGGrAGTOGC CGATAGQGIG AGGAATGTIG TOGAQGCTGA TGAOOGGTAT TATCTCGTOG 240 

TOGCTGAGAA COGACITAAG CTAGAGAAAG GATCAGATIT GAAGGOGAGA GAGGTGAAGG 300 

AGAGTITAGG GATQGTIGIT TTGGTGGTGA GGCOGCCAAG AGAAGATGAT GATGATTGGC 360 

AGACAAGTCA TCAGAACTGG GACTGAATTA ATAGAATCAA TACTCATATG CTCTAACTGA 420 

TTACGGAGTC ATCATQGTCA TGTAAAAnr ITGGATAAAG GTGGTAACTT TJTCTTCTAA 480 

GATACAATCA GAAACAGAGC AATATITITC TCTAAAAAAA AAAAAAAAAA AAAA 534 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: NO 
(iii) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arabidqpsis t±ialiana 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: 4A5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Met Asp lie Pro Gly lie Glu Cys Tyr Pro Lys Arg Met Lys Asn Gly 
15 10 15 

lie Pro Pro Ser Trp Thr Pro Cys Thr His Trp Glu Ser Arg Val Ala 
20 25 30 
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Phe Ser Phe Arg Asp Asp Arg Lys Val Leu Pro Trp Asp Gly Lys Glu 
35 40 45 

Glu Pro Leu Leu Val Val Ala Asp Arg Val Arg Asn Val Val Glu Ala 
50 55 60 

Asp Asp Gly Tyr Tyr Leu Val Val Ala Glu Asn Gly Leu Lys Leu Glu 
65 70 75 80 

Lys Gly Ser Asp Leu Lys Ala Arg Glu Val Lys Glu Ser Leu Gly Met 
85 90 95 

Val Val Leu Val Val Arg Pro Pro Arg Glu Asp Asp Asp Asp Trp Gin 
100 105 110 

Thr Ser His Gin Asn Trp Asp 
115 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STE?ANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPCTHETICAL: NO 
(iii) ANTI-SENSE: NO 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: primer V6 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
ATGCITTGCA TAACITTGAG G 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHEmCAL: NO 
(iii) ANTI -SENSE: NO 

(vii) IMMEDIATE SOURCE: 

(B) CLONE: primer T7 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 
AATACGACTC ACIATAG 
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